Visual information retrieval from historical document images
نویسندگان
چکیده
منابع مشابه
Layout Based Information Retrieval from Document Images
This research is intended to develop a layout based retrieval system for document image databases consisting of three phases: 1. At first, intelligent layout analysis algorithm has been designed to extract the layouts the document images physically with their edges and rectangles. 2. Every physically identified layout has been converted into a tree intermediary representation for indexing and s...
متن کاملDocument retrieval from compressed images
With the emergence of digital libraries, more and more documents are stored and transmitted through the Internet in the format of compressed images. It is of signi/cant meaning to develop a system which is capable of retrieving documents from these compressed document images. Aiming at the popular compression standard-CCITT Group 4 which is widely used for compressing document images, we presen...
متن کاملSegmentation-Free Keyword Retrieval in Historical Document Images
We present a segmentation-free method to retrieve keywords from degraded historical documents. The proposed method works directly on the gray scale representation and does not require any pre-processing to enhance document images. The document images are subdivided into overlapping patches of varying sizes, where each patch is described by the bag-of-visual-words descriptor. The obtained patch ...
متن کاملAutomatic Keyword Extraction from Historical Document Images
This paper presents an automatic keyword extraction method from historical document images. The proposed method is language independent because it is purely appearance based, where neither lexical information nor any other statistical language models are required. Moreover, since it does not need word segmentation, it can be applied to Eastern languages where they do not put clear spacing betwe...
متن کاملInformation Retrieval from Historical Corpora
With the increasing number of documents that are available in digital form, also the number of digital historical documents is increasing (Berkvens, 2001). It cannot be assumed that standard IR systems perform well on historical documents: historical texts differ from modern texts in three ways (Hüning, 1996; Van Der Horst and Marschall, 1989): (a) vocabularies have changed, (b) spelling has ch...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Cultural Heritage
سال: 2019
ISSN: 1296-2074
DOI: 10.1016/j.culher.2019.05.018